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Abstract 

We introduce a new family of codes, termed asymmetric Lee distance (ALD) codes, designed to correct errors arising in DNA- 
based storage systems and systems with parallel string transmission protocols. ALD codes are defined over a quaternary alphabet 
and analyzed in this particular setting, but the derived results hold for other alphabet sizes as well. Our technical contributions are 
two-fold: First, we derive upper bounds on the size of the codes under the ALD metric based on linear programming techniques. 
Second, we propose a number of code constructions which imply lower bounds. 
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I. Introduction 

Codes for classical symmetric channels with single-sequence inputs and single-sequence outputs have been extensively studied 
in the past and a number of diverse solutions such as algebraic codes fTS) , codes on graphs - e.g., LDPC codes GZI - and 
polar codes | fT9| have been proposed to address the underlying communication problem. Similar advances have been reported 
for parallel channels |j^, under the rather common assumption that the individual channels introduce uncorrelated errors, as 
well as for channels with asymmetric error properties GD- In both scenarios, the alphabet size of the codes is restricted by the 
system design, and often, input sequences are de-interleaved or represented as arrays over smaller alphabets in order to enable 
more efficient transmission. Far less is known about channels that operate on several sequences at the same time and introduce 
correlated and asymmetric symbol errors. The goal of this work is to introduce and analyze one such scenario, motivated by an 
abstraction of emerging read channels used for DNA-based storage systems GD^ GD- 

DNA-based storage systems have recently been implemented using biochemical methods for synthesizing (writing) the desired 
digital content in DNA 0, Q, | |22) , and reading the content via DNA sequencing systems. DNA synthesis may be performed 
with small error rates, the dominant error events being simple substitutions The read process introduces errors that cannot 
be modeled using standard information-theoretic channels. For example, the high throughput Illumina sequencing platform may 
be viewed as a channel that has a single sequence as its input and a multiset of the input sequence substrings as its output GD- 
On the other hand, fast evolving, miniaturized nanopore sequencers such as MinlON G3 introduce context-dependent, bursty 
deletion errors. 

The goal of this work is to propose designs for error-correcting codes capable of handling “asymmetric” errors arising in new 
solid state nanopore sequencing systems. In these systems, the DNA strings are read one symbol (nucleotide) at the time, and 
the identity of the symbol is determined by the drop of an otherwise fixed current caused by the nucleotide passing through the 
sequencing pore 0, G§- Since nucleotides have different atomic structures, they lead to different current drop responses. A 
stylized depiction of one such system response to the four nucleotides A, T, G, C is shown in Figure 0 



Fig. 1. Nanopore sequencers: detection system output distribution {y-a.xis) versus current drop in nA (fc-axis), along with a mapping of binary words of length 
two to the DNA alphabet, as adopted in our analysis. 

This work was supported in part by the NSF STC Class 2010 CCF 0939370 grant and the Strategic Research Initiative (SRI) Grant conferred by the University of 
Illinois, Urbana-Champaign. Parts of the results were presented at the Information Theory Workshop (ITW) in Jerusalem, 2015, and the International Symposium 
on Information Theory (ISIT), 2015. 



2 



(0;1) C 


Fig. 2. A weighted confusion graph for the code alphabet {A, T, G, C}, involving n = 1 symbols. The symbols are encoded as pairs of bits. 

As may be seen from Figure the impulse response distributions corresponding to T and C exhibit significant overlap, making 
the symbols highly confusable. In contrast, the impulse responses corresponding to A and G are widely separated and are very 
unlikely to be confused. The pairs of nucleotides G and T and A and C are only slightly confusable. Using a sequencer with the 
given symbol response would lead to asymmetric errors, in so far that one would have high incidence rates of T-C substitution 
errors, a significantly smaller incidence rate of G-T and A-C substitution errors, and a very low incidence rate of A-G substitution 
errors. 

To abstract this error model, we introduce an asymmetric confusion graph shown in Figure There, we used the following 
binary encoding of the DNA alphabet: G —>■ (0; 0), A —(1; 1), C —>■ (0; 1) and T —^ (1; 0). The labels of the edges of the confusion 
graph depend on a positive integer parameter A and are chosen to have a simple form that inversely correlates with the probability 
of the corresponding substitution erro|^ As an example, the pairs of bits (0; 0) and (1; 0) are twice more likely to be confused 
than the pairs of bits (0;0) and (1; 1). Hence, the distance “induced” by the former pair is half the distance “induced” by the 
latter pair. Clearly, one could adopt a significantly more general model for the labels of the confusion graph which more precisely 
captures the actual errors in the sequencing devise; but such a model would introduce unnecessary notational inconvenience and 
lead to tedious calculations that obscure the actual ideas behind the analysis. 

The confusion graph in Figure has another interpretation in terms of a parallel channel model, interesting in its own right. 
To motivate this interpretation, consider Figure which depicts a transmission model where two binary input sequences are 
simultaneously passed through two channels that introduce substitution errors. Simultaneous errors in both strings are less likely 
than individual string errors. In addition to the substitution errors, the outputs of the channels may be switched - in other words, 
the label of the channel from which the output symbol originated may be in error. The confusion graph for this type of channel is 
also captured by the model of Figure]^ where the vertices are indexed by pairs of bits denoting the inputs into the two channels. 
Such switching errors may arise due to loss of synchronization in de-interleaving or due to the lack of channel side information. 

The central problems of the presented work are to evaluate the ultimate coding limits under the distance induced by the symbol 
metric of Figure and to describe combinatorial constructions for pairs of binary sequences (codewords) that are at a sufficiently 
large “asymmetric Lee distance” (ALD) from each other. As discussed in the sections to follow, the ALD equals a scaled version 
of the Lee distance with an additional modification that two symbols are deemed to be of higher importance than the other two 
symbols. The two distinguished symbols capture the uncertainty about the actual ordering of the readouts in the parallel channel 
or the fact that two nucleotide impulse responses tend to show a high degree of overlap. The ALD was not previously studied 
in the literature, although its relationship with the Lee metric allows one to adapt certain proof methods 0 used for the latter 
distance to the case of the ALD. Nevertheless, the asymmetry of the ALD also calls for developing new, specialized coding 
techniques that enable deriving tight bounds on the code size and constructing codes that approach these bounds. 

The contributions of this paper are three-fold. First, we introduce a new channel model and the corresponding distance measure 
that has applications in macromolecular storage. Second, we derive upper bounds on the size of codebooks under the newly 
introduced distance using recently proposed linear programming (LP) approaches on hypergraphs m and LP methods based on 
alphabet extensions and Delsarte-like techniques 0 - Third, we describe a number of linear and nonlinear code constructions and 
accompanying decoding procedures. 

The paper is organized as follows. Section contains a formal definition of the ALD. Section III describes how to use linear 
LP methods on hypergraphs in order to derive non-asymptotic upper bounds on codes in the ALD. Section IV introduces a new 


LP framework that uses specialized alphabet embeddings. Sections and VI present new code constructions for the ALD. 


11. Preliminaries and Problem Formulation 

For a,b G F 2 , the indicator function l(a, 6) takes the value l(a, 6) = 1 if a = b, and zero otherwise. Similarly, an n-variate 
indicator function for a collection of n symbols oi,..., o„ over F 2 takes the value l(ai,..., a„) = 1 if oi = 02 = ... = a„, 
and zero otherwise. As an illustration, 1(0, 0,0) = 1, while 1(0,1,1) = 0. 

Consider next four sequences a = (oi,..., a„), b = (bi,..., bn), c = (ci,..., c„), d = (di ,..., d„) S F^, paired as (a; b) 
and (c; d). The pairing of two binary sequences of length n results in one sequence over a quaternary alphabet as illustrated by 
the following example ((0,1, 0); (1,1,1)) = ((0; 1), (1; 1), (0; 1)) where ((0; 1), (1; 1), (0; 1)) G (Fl)^. 

^Note that one can choose different functions of A for the edge labels, as long as the resulting distance d satisfies d((0; 0), (1; 1)) < d((0; 0), (1; 0)) < 
d({0; 0), (1; 1)). The proposed functions A, A + 1, 2(A + 1) were chosen for simplicity of analysis and based on the fact that for A = 1, they produce weights 
that are powers of two, namely 1, 2, 4. 
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Fig. 3. A pair of channels with individual substitution en'ors, and outputs which may be switched. In the given example, the outputs of the channels at position 
three are switched. 


Fig. 4. A weighted confusion graph for the Lee distance 



For a positive integer A, the ALD d\{{a; b), (c; d)) between two pairs of binary sequences (a; b), (c; d) is defined as 

n 

dx{{a\b), {c]d)) =X! (^ + (l(aj,6i) + l{ci,dt)) + (1) 

A)1 (^di , bi^ Cj, , 

where d denotes the complement of the bit a. 

The confusion graph for a code of length n = 1 under the ALD matches the one depicted in Figure For example, it is easy 
to see that ^^(( 0 ; 0 ), ( 1 ; 1 )) equals 


(1 + A) (1(0,0) + 1(1,1)) + A1(0,1,0,1) - 2(1 + A)1(0,0,1,1) 
= (1 + A)(l + 1) + A 0 — 2(1 + A) 0 = 2(1 + A). 


It can be verified that d\{{a] b), (c; d)) is a metric with respect to the pairs of binary sequences in the argument; dx is non¬ 
negative and symmetric, and it satisfies the triangle inequality. From 0 and the confusion graph, we also observe that the ALD 
is asymmetric in terms of the costs of different transitions, and in particular, complementary pairs are treated differently than 
non-complementary pairs. Furthermore, when pairs are complementary, the distance depends on the binary weight of the pairs. 
The choice of the parameter A governs the degree of asymmetry. 

In order to highlight the relationship between dx{{a; b), (c; d)) and the Lee distance, let us first recall that the Lee distance 
between two symbols a, b over Z 4 is defined as c?L(a, b) = min(|a — 6 |, 4— |a — &|). Next, let us introduce a mapping Z : —>■ Z 4 

such that (0; 0) —>■ 1, (1; 0) —0, (0; 1) —>■ 2 and (1; 1) 3. By changing the weight between (1; 0) and (0; 1) from A to 2(1 -f A), 

we arrive at a scaled version of the Lee distance ^^(a, b) between two symbols a, 6 S Z 4 , i.e.. 


(1 + A) • min{4 — |a — b\, |o — b\} = (1 + A) • d^la, b). 


( 2 ) 


More precisely, for two sequences Zi = (a; b),Z 2 = (c; d) viewed over Z", we have 


dx{zi, Z2) = {I + X) ■ dL{zi, Z2)- (3) 

(2 + A) 5 : (1 - l(zi^i, Z2,i)) < (1 -f A) • dL{zi,Z2). 

{zi,i>22.i}={0,2} 

Simple inspection of the transition diagram for the ALD also reveals that 

dx{zi,Z 2 ) > ^dL{zi,Z 2 ). 

Furthermore, the Lee distance over Z4 equals the Hamming distance over Z^ under Gray mappings from F| and Z4. Hence, 
since Z is a Gray map, we have 

dxizi,Z2) ^ -dH{Zl,Z2). 

Note that the distance between pairs of symbols {a;b), {c;d) G F| under the ALD depends on the exact values of the bits in 
(a; 5) and (c; d) and not on their sum. For example, if A = 1, (a; 5) = (0; 0) and (c; d) = (1; 1) then dx{{a\b), {c;d)) = 4; on 
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the other hand, for the same choice of the parameter A and for (a; b) = (1; 0) and (c; d) = (0; 1), we have dx{{a', b), (c; d)) = 1, 
despite the fact that in both cases we have (a; b) + (c; d) = {a + c',b + d) = (1; 1). At the same time, for both choices of paired 
vectors, the underlying Lee distance equals two, as the Lee distance for the given mapping Z only depends on the Hamming 
weight of (a; 6) + (c;(i). The connection between the ALD, Lee metric, and asymmetric error-correcting codes will be used in 
some of our subsequent derivations. 

In what follows, we refer to an error that causes a transition between (1; 0) and (0; 1) as a Class 1 error, similarly, we refer to 
an error that causes a single substitution in one of the input strings as a Class 2 error. An error that causes a transition between 
(0; 0) and (1; 1) is referred to as a Class 3 error. Note that based on Figure]^ an edge in the confusion graph corresponding to 
a Class 1 error has weight A, an edge corresponding to a Class 2 error has weight 1 -f A, while an edge corresponding to a Class 
3 error has weight 2(1 -f A). 

Let d be a positive integer. We say that two pairs of sequences (a;b),(c;d) G F 2 x F 2 are (d,\)-distinguishable if their 
ALD d\ is at least d; conversely, we say that two pairs of sequences are (d, X)-indistinguishable if their ALD is less than d. 
Furthermore, for an ordered pair of sequences (a; 6 ) G F 2 x F 2 , we let 

:= {(c;d) G F^ x F^ dx{{a-,b),{c-,d)) < r} (4) 

denote the set of pairs of sequences that are (r + 1, A)-indistinguishable from (a; b). We also use A\{n, d) to denote the largest 
number of (d, A)-distinguishable sequences of length n. 

Example 1. Let ((0,1,1); (1,1,1)) G F 2 x F^. Then, the following eight vector pairings 

(( 0 , 1 , 1 ); ( 1 , 1 , 1 )), (( 1 , 1 , 1 ); ( 1 , 1 , 1 )), (( 0 , 1 , 1 ); ( 0 , 1 , 1 )), 

(( 1 , 1 , 1 ); ( 0 , 1 , 1 )), (( 0 , 0 , 1 ); ( 1 , 1 , 1 )), (( 0 , 1 , 1 ); ( 1 , 0 , 1 )), 

(( 0 , 1 , 0 ); ( 1 , 1 , 1 )),(( 0 , 1 , 1 ); ( 1 , 1 , 0 )) 

are (3,1)-indistinguishable from ((0,1,1); (1,1,1)). For example, the ALD between ((1,1,1); (1,1,1)), the second pair in the 
above list, and ((0,1,1); (1,1,1)), the chosen pair, equals 1 -f A = 2, and this distance arises from the pair of symbols in the 
hrst positions, ( 1 ; 1 ) and ( 0 ; 1 ). 

On the other hand, there are seven pairs of sequences (3,1)-indistinguishable from ((0, 0, 0); (0,0, 0)), namely 

(( 0 , 0 , 0 ); ( 0 , 0 , 0 )), (( 1 , 0 , 0 ); ( 0 , 0 , 0 )), (( 0 , 0 , 0 ); ( 1 , 0 , 0 )), 

(( 0 , 1 , 0 ); ( 0 , 0 , 0 )), (( 0 , 0 , 0 ); ( 0 , 1 , 0 )), (( 0 , 0 , 1 ); ( 0 , 0 , 0 )), 

(( 0 , 0 , 0 ); ( 0 , 0 , 1 )). 

Note that the sum of the two vectors in the hrst example equals a + b = (0,1,1)-|-(1,1,1) = (1,0,0), and hence has Hamming 
weight one. On the other hand, the sum of the two vectors in the second example equals a + b = (0, 0,0) -f (0, 0, 0) = (0,0, 0) 
and has Hamming weight zero. 

Although the ALD depends on the exact values of its arguments a, 6 , we show next that only depends on the 

Hamming weight of a + b, where the sum is taken modulo 2. This claim parallels a result for the ^-channel, which asserts that 
the size of the error sphere is a function of the Hamming weights of the transmitted sequences HD. To draw upon this parallel, 
we subsequently denote the Hamming weight of a -f b by w{a;b). Clearly, w{a-,b) = dH{a,b), where dH{o,,b) denotes the 
Hamming distance between a and b. 

Lemma 1. Let (a; b) G F 2 x F 2 be a pair of sequences with weight w{a; b). Then, |S(r,A) (ct; b) \ depends on (a; b) only through 
w{a; b). 

Proof: From Q and Q, one can see that |S(r,A)(Q; ^)l equals the number of paired sequences (c;d) G F 2 x F 2 that satisfy 

n 

^(1 + A) (l(ai, bi) + l(ci, di)) + 

i=l 

AI (Ui , bi, Ci, di') ^(1 “f X)^(^CLi,bi, Ci, di)^ ^ T. 

First, assume that a and b are hxed and let E = {i ■. Oi = hi] and D = {i : Oi hi}. We are interested in the number of paired 
sequences (c; d) G F 2 x F 2 that satisfy 

y ' (1 + '^)(1 + l(ci, di)) — 2(1 + X)K^ai, Ci, di)+ 

ieE 

^(1 -I- A) l(ci, di) + Xl{ai,Ci, di) ^ r. 

ieD 
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The above inequality may be rewritten as 

n 

(i + a)E 1 (Cj , ) 2(1 ~1~ A) ^ ^ 1 1 di )“h 

2—1 iGE 

Ay^ l{ai,Ci,di) ^ r - (1 + A) |i;|. 

ieD 

The first term in the above sum does not depend on the choice of a. For i G E, independent on the choice of a^, the second 
sum provides a non-zero contribution if and only if Ui = Ci = di. For i G D, independent on the choice of Ui, the third sum 
provides a non-zero contribution if and only if Ui = Ci = di. Hence, it follows that the number of choices for c and d satisfying 
the inequality depends only on \E\ and \D\, and consequently on w{a-,b) only, as w{a;b) = \D\ and \E\ -|- \D\ = n. ■ 

Next, we turn our attention to deriving an expression for V\{n, w,r) := t>)\, with w{a; b) = w. Recall that ;B(r,A)(<^5 

was defined in 0 and let 

Sxin, w, r) := {(c; d) gW^ : dx{{a; b), (c; d)) = r}, 

with Sx{n, w, r) = |SA(n, w, r)|. Clearly, Vx{n, w, r) = w,j). The next lemma establishes a formula for Sx{n, w, r) 

which may be subsequently used to determine Vxin,w,r). For simplicity of notation, we use k*£*m*X = r to denote 
{2k -I- •^)(1 -I- A) -I- Am = r, and similarly, fc*£*m*A^rto denote {2k -I- €)(1 -I- A) -I- Am ^ r. 


Lemma 2. For positive integers n, w and r such that w ^ n, 

Sx{n,w,r) = 


E 


W 

m 


n — w 
k 


n — k — m 
£ 


2^. 


Proof: We count the number of sequences (c; d) C F 2 x F 2 such that dx{{a; b)\ (c; d)) = r, where w{a-, b) = w. Suppose 
that (c; d) is the result of m Class 1 errors, k Class 3 errors, and £ Class 2 errors occurring in (a; b). We first arbitrarily choose 

the locations of any of the Class 1 errors from among the w locations in (a; b) for which oi 7 ^ 61 . There are ^ ^ ^ choices for 

the locations. Similarly, we can choose the locations of the Class 3 errors amongst the n — w locations where oi = bi. There are 

Ti — VJ \ 

^ 1 choices for these locations. The locations of the Class 2 errors are chosen among the remaining n — k — m locations 

in (a; b), and this choice of locations is coupled with the choice of which of the 2 available bits is changed (recall that a Class 

n — k — m 


2 error causes a substitution error in one of the two input strings at any given location). There are 
for the Class 2 errors, which proves the statement in the lemma. 


2 ^ choices 


Corollary 3. For positive integers n, w and r such that w ^ n, 

Vx{n,w,r) = 


E 

k,i,m: k*£.*m*X^r 


W 

m 


n — w 
k 


n — k — m 


2\ 


The next corollary will be useful for the derivations in the sections to follow. 

Corollary 4. For positive integers n, w and r such that w ^ n, it holds that Vx{n, w,r) ^ Vx{n,w — 1, r). 
Proof: To prove the claimed result, consider the ratio 

h-rr, \ 


w 

m 


n — w 
k 


.-k-' 


w{n — w) 


w — 1 
m 


n — w — 1 
k 


n — k — m 


2 « 


Since w ^ w — m and n — w^n — w — k for all possible parameter choices, one has ^ 1- Given that 

r' r' ’ {^w—m){n—w—k) 

n—k — m 


Vx{n,w,r)= 


k*i*m*X^r 


W 

m 


n — w 
k 


{w — m){n — w — k) 

w{n—w) 
[n- 

2 ^ 


it clearly follows that r) ^ Thus, Vx{n,'w,r) ^ Vx{n,w — l,r), as desired. 


111. Upper Bounds from Hypergraphs 

In what follows, we focus our attention on non-asymptotic upper bounds derived based on a recent optimization on hypergraphs 
framework. We start our exposition by introducing the relevant notation borrowed from (131. 
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A. Hypergraph Preliminaries 

A hypergraph H is an ordered pair of sets {V,E), where V denotes the set of vertices and where E denotes the set of 
hyperedges, i.e., selected subsets of V. A matching in "H is a set of disjoint hyperedges, and we use to denote the size of 

the largest matching, i.e., the maximum number of disjoint hyperedges in H. The number is also known as the matching 

number of the hypergraph H. A fractional matching of a hypergraph is a function / with domain E and range [0,1] that assigns 
a value /(e) to each hyperedge e so that J2eeE{v) /(®) ^ ^ vertex v and its set of incident hyperedges E{v). If we 

restrict the range of / to { 0 , 1 }, the fractional matching reduces to the indicator function of a matching. Ahs fractional matching 
number of a hypergraph H, denoted by equals 


or equivalently 


= sup ^ /(e), 

/:/ a fractional matching 


\E\ 


"{'H) = sup < ^ Zj : M • z < l,z e 


i=l 



where M denotes the vertex-hyperedge incidence matrix of the hypergraph. 

A subset of the vertex set that has a nonempty intersection with every edge is called a transversal of the hypergraph H, and 
the smallest size of a transversal is called the transversal number of the hypergraph, denoted by Similarly as for the case 

of matchings, one may define di fractional transversal as a function g with domain V and range [0,1] that assigns a value g{v) 
to each vertex v so that J2vev{e) 5'(^) ^ ^ edge e and its set of incident vertices T^(e). If we restrict the range of 

g to {0,1}, the fractional transversal reduces to the indicator function of a transversal. The fractional transversal number of a 
hypergraph "H, denoted by equals 


= inf g{v), 

g.g 3i fractional transversal 

vGV 


or equivalently 


t*{'H) = inf 




Using duality arguments, it can be shown that v*{T-L) = 

Next, fix two positive integers r and A and recall the definition of from Q. We introduce the following hypergraph 

^(r.A).n = (F^ X F^, {%.A)(a; b) : (a; 6 ) e x F”}), 


with vertex set F^ x F^ and edges that represent subsets of vertices within ALD balls of radius r around some vertex. Then, the 
following result follows directly from In- 


Lemma 5. Given the hypergraph 'H{r,x),n, one has 

Ax{n, 2r + l) = v{U(r,\),n) < 'r*('H(r,A),n)- 

Intuitively, the lemma asserts the fact that a matching in the hypergraph corresponds to a packing of balls in the ALD metric, as 
no two hyperedges (balls) are allowed to intersect. Thus, the matching number is the largest number of disjoint balls that may 
be packed in the corresponding space, and hence it is equal to the largest size of an ALD code. 

We find the following reformulation of the aforementioned bound useful | [T3] . Let A(,r,A) be a square matrix with 4” rows and 
columns, indexed by the elements of F 2 x F^. We set A(j. >,)(//) = 1 if (a^i 1 ; Xi^f) G S(j-,A)(3^j,i; *^, 2 ), and zero otherwise, for 
any two pairs (at^ 1 ; {xjp\Xj, 2 ) G F 2 x F 2 . Observe that A(^r,\) is the adjacency matrix of the ALD r-confusion graph. 

Let K+ denote the set of non-negative reals. Then, 

= min I : in € : A.f^,x) • lu > l|. (5) 

For simplicity, we subsequently write r* (A(r,A)) to refer to t* {'H(r,x),n), as r* {'H(^r,x),n) is a function of A(^r,x) only. Note 
that (|^ is an LP involving 4" constraints, which may be prohibitively large to solve even for moderate values of n. We show 
next how to significantly reduce this number of constraints by using hypergraph symmetries |j^. 

Following the approach of |j^, define a directed graph G{r,x) on the vertex set F 2 x F 2 , with an arc from {xi; X 2 ) to (j/i; 1 / 2 ) 
iff (yi;y 2 ) g (a:i;a; 2 ). An automorphism tt of G{r,X) is a permutation in 84 ™ (the symmetric group of order (4")!) that 

operates on the vertex set and preserves adjacencies. The set of all automorphisms of G{r,x) is denoted by Aut(C/(j.,A)) and 
represents a subgroup of S 4 ».. The group Aut((/(r,A)) induces a relation R on the vertices of G{r,x) in the following manner; 
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II 

3 

5 

7 

9 

11 

13 

1 

3 

1 

1 

1 

1 

1 

2 

9 

2 

2 

1 

1 

1 

3 

30 

7 

5 

2 

2 

1 

4 

99 

21 

13 

5 

4 

2 

5 

336 

67 

35 

12 

9 

4 

6 

1161 

219 

101 

32 

21 

9 

7 

4080 

731 

296 

90 

51 

20 

8 

14535 

2483 

895 

258 

130 

50 

9 

52377 

8563 

2783 

771 

348 

127 

10 

190557 

29901 

8890 

2361 

966 

340 


TABLE I 


Bounds on ALD code sizes based on the LP in 


{x,y) G R iff there exists a tt S Aut(C/) such that 7 r(x) = y. Clearly, R is an equivalence relation. Hence, given a subgroup H 
of Aut(^(r,A)) of order uh, we may use H to partition the vertex set into uh equivalence classes {Vh,i, • ■ ■, VA.njj}■ 

With a change in the indices of the corresponding expression in @,let Aff be an uh x uh matrix indexed by the elements 
of the subgroup H, with entries A//j) dehned according to 


Xi;x2),iyi-,y2)) ■ ixi-x2) G Vh,j, (yi;y2) g VA,* n%,A)(a^i;a:2)}| 


^ j < nn- 


We interpret the matrix A// a) as follows: One is given nn equivalence classes, representing the indices of the rows and 
columns of the matrix. The entry in row i and column j is the normalized number of pairs (aii; X 2 ), (j/i; 2 / 2 ) ^'^oh that {xi;x 2 ) 
is in the equivalence class j, and {y^;y 2 ) is in a subset of the equivalence class i that comprises points in an r-ball around 
{xi;x 2 ). Normalization is performed with respect to the size of the j-th equivalence class, |Vf 7 j|. 

Using the ALD hypergraph symmetries, we can simplify the statement in Lemma and arrive at the following theorem which 
follows from the methods introduced in im. 


Theorem 6 . Let H be a subgroup of Aut(0(r,A)) let A^ (^.a) be as defined before. Then, 

{ riH 

^\VhA -w^-.wG K”", w^l 

i=i 

and Ax{n, 2r + 1) ^ T*(A(r_A))- 

We next identify a set of automorphisms of G(r,\} that allow us to apply Theorem to the case of ALD codes. For every 
permutation a — (cr(l),..., cr(n)) G S„ and for every 2 ; G F 2 , let 

X ^ ¥1^ x F^ ( 6 ) 

be a mapping such that for all (a; b) G F^ x F^, ( 7 r^,z(a; 6 ))i = (a^(i); b^(i)) if Zi = 0 and {TTa,z{a-,b))t = (d<^(i); b<^(j)) 
otherwise. It is straightforward to see that H = {tTo-.z : cr G S„,z G F 2 } is a subgroup of Aut(t7(r,A)): Coordinate permutations 
do not change the code, and complementation preserves the ALD symmetries of the pairings. 

The subgroup H partitions the space F 2 x F 2 into n + 1 equivalence classes, with (a;b),(c;d) G F 2 belonging to the 
same equivalence class if and only if w{a;b) = w{c;d). The equivalence class containing a pair of vectors a and 6 of weight 
w{a; b) = i has 2" (") elements. Consequently, 

T*(A(r^A)) = min 

w G M”+\ w^l 



By using the automorphism group, the dimension of the weight vector w reduces from 4", as given in to n + 1, as given 
in 0 - Solutions of the LP of 0, for n < 9 and odd dA=i ^ 13 are listed in Table I. 

To complete our analysis, we exhibit two feasible weight vectors for the optimization problem in 0 which lead to appropriate 
upper bounds on the size of ALD codes. 

Theorem 7. Let Va {n, w, r) = Va (n, 0, r) for w < 0. Suppose that n, r, X are positive integers, and set p, = [ J. Let the elements 
ofw = {wo, wi,..., Wn) G be given as 

Wi = 777——— 

Vx[n,i- p, r) 
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Then, Theorem^leads to 


A,(n,2r + l)^2--f^( ^ ) 

i=0 ^ ' 




Proof: The result follows directly from Theorem and (j^, provided that we can show that • tn ^ 1. For the given 

weight assignment, and for an arbitrary choice of an equivalence class VH,j and (a; 6 ) G Vhj, we have 


E 


{((a;b),(c;d)):(a;5)GVif,j,(c;d)GVH,in6(r-,x)(a;5)} 


'^ii;(c;d) 


E ^ 
= E 

(c;d)GB(,.,A)(a;t>) 

^ E 

(c;d)GB(,.,A)(a;t>) 

Vx{n,w{a-b),r) 


u{c;d) 


Vx(n,w(c; d) - p,r) 
1 

VA(n,w(a; 6 ),r) 

= 1 , 


V\{n,w{a-b),r) 

where the inequality follows from Corollary and the observation that the weight of any element in B(j. x^{a\b) is at most 
u>(a; b) + /r. ■ 

Note that the choice of weights w from the previous theorem produces the best possible upper bound on A\-r{'n,2r + 1) 
achievable via Theorem This can be seen by noting that for r = A, one may write 


t*(A(a,a)) =mini 2’" • ^ 


t=o 


■ Wi : 


wi ■ {i + 1) ^ 1,0 n,wt ^ 0 j. 

The bounds to follow are concerned with special choices of the problem parameters and do not involve solving LPs. 
Proposition 8 Suppose that r = \. Then 


^(A(,,a)) = 2 ".^ 


Q) _ 2 ”( 2 ”+i - 1 ) 


e=o 


£ + 1 


n+1 


Hence, Ax{n, 2A + 1) < 2"(2"+i - l)/(n + 1). 

Proof: Note that when r = X, AH,{r,x) is a diagonal matrix with entries ai^i = £ +1 for £ G {0,1,..., n}. Let lu be a vector 
with entries wg = (y^), £ = 0,... ,n. Then, A|^ • to = 1 and to is a feasible solution of the underlying LP. On the other 

hand, since A^ a) is symmetric. A// a) ■ w = 1 and in is a feasible solution of the dual LP. This proves the claimed result. 

■ 

The following proposition may be proved along the same lines as Proposition It relies on using sphere-packing arguments 
and a subgraph of the ALD confusion graph. 


Proposition 9 For positive integers r, A, 


Ax(n, 2r + 1) ^ 2" • y ^ 


'L/AJ 


J 


^J=0 

Proof: The proof follows from the simple observation that the vector w with entries 

( 1 


wt = 




[r/AJ 


f =0 


£ = 0,... ,n, 


satisfies A^ w ^ 1, and is hence a feasible solution of the linear program. 
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Consider next the special case when A = 1, and r ^ 2. To ease the notation, we introduce a function K{n,r,i,j), such that 

LiJ 


K{r 


e=o 


n — I 

j -i + ‘ 


■2^ 


-i+t 


E 

4/c+m^r —2(j —2+2^) 


n- j -i 
k 


for j ^ i, and 


LiJ 

K{n,r,i,j) 

i=o 


i-j + 

j -f- 


i-i 

m 




m 


n — i — i 
k 


E 

4:k-\-m^r—2{i—j-\-2£) 

for j < i. We also assume that K{n,r,i, j) = 0 whenever r — 2\i — j\ < 0. Given this setup, we may write = 

j=v where Uij = K{n,r,i — l,j — 1). The above claim follows from the observation that 

K{n,r,i,j) = ||(yi;y 2 ) : = j, (yi;y 2 ) e ^(r,i)(a:i;a^ 2 )| 

where w{xi-,X 2 ) = i. To arrive at the expression for the above set-cardinality for the case j ^ i, one has to observe that one 
may choose £ locations for Class 2 errors from among the i locations where Xi and X 2 differ in value; one may then choose the 
locations of the remaining j — ii — f) Class 2 erTors from among the n — i locations where Xi and X 2 are equal in value. From 
the remaining locations, we may choose the locations of k Class 3 erTors from among positions in which the vectors agree in 
value, and m Class 1 errors from among positions in which the vectors disagree in value. Similar reasonings apply for the case 
that j < i. 

Next, we produce a weight assignment for (j^ by considering another matrix A(n,r) related to A^ that takes advantage 
of the structure of The basic idea is to change some values of A^ so as to ensure that the diagonal elements of 

A(n, r) are sufficiently large in order to invoke certain results regarding positive solutions of linear equations. The weights are 
then given as w = A{n,r)~^ ■ 1, and Theorem [TT| establishes that w = A{n,r)~^ • 1 is a feasible weight assignment for Q. 
The matrix A(n, r) is constructed as follows: let A(n,r) = Then, 


ttij = min I 


(^K{n,r,i- l,j - 1), ^{K{n,r,i- l,z - 1) - 1)^ 


for j ^ i, and Ui^i = K(n,r,i — l,i— 1). We hnd the following lemma useful for the proof of Theorem 
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Lemma 10. (c.f. lOJ) Suppose that the matrix M = {rriij) G contains no zero elements on the diagonal, and let b > 0 be 


chosen arbitrarily. If for all * S {1,..., n}, 


bi> ^ rriij^- 
TTLn 


then M is invertible and M ^6^0. 

Theorem 11. For any two positive integers n, r, we have 

n 

Ax=i{n,2r + l)^2^ -Y, 

e=o 




where w = {wq, Wn) = A(n, r) ^ • (1,1,..., 1). 

Proof: First, note that by the dehnition of the matrix A(n, r), for all i S {1,..., n + 1}, one has 


n+1 


n _• . 


i = l j/i 


< 


*+L5J 

E 


K{n,r,i — \,i — 1) — 1 
rKiji, r,i — l,i — 1) 


< 1 . 


Based on Lemma 


10 


there exists a w with non-negative entries such that A(n,r) • m = 1. By definition, we also have 


Afj ^ A{n,r) and since u; ^ 0, it follows that 


^ A(n, r) ■w = l 
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Length 

Bound from 

Theorem f/] 

Proposition!^ 

Theorem [Til 

5 

6 

67 

219 

427 

1079 

254 

793 

112 

354 

7 

731 

2750 

2508 

1170 

8 

2483 

7181 

8048 

3793 

9 

8563 

19485 

26190 

12008 

10 

29901 

55529 

86393 

40400 

11 

105490 

166902 

288649 

139753 

12 

375448 

527725 

975954 

486642 

13 

1346201 

1742275 

3336118 

1665472 

14 

4858171 

5949948 

11518362 

5679816 

15 

17631726 

20833123 

40130869 

19999983 


TABLE II 

Comparison of Upper Bounds for Ax=i (n, 5) 


as desired. Thus, since w 1 and in ^ 0, to is a feasible solution for the program in 

The results of our bounds on Ax^i{n, 5), when n ^ 15, are listed in Tabl e [n| In general, the bound from Proposition [^provides 
the weakest upper bound on the code size while the bound from Theorem 11 best approximates the optimal weights solving the 
LP (0. Notice that the bound from Theorem]^ is weaker than the bound from Theorem [TT] for all code lengths considered. 

In the next section, we consider another approach for deriving upper bounds on A\{n, d). The method we use is a combination 
of novel symbol mapping techniques and the classical Delsarte-like LP approach. As will be shown by computer simulation, 
Delsarte-like bounds tend to outperform hypergraph methods for ALD codes for a number of parameter settings. 


IV. Upper Bounds erom Delsarte-Like Inequalities 

We focus next on classical LP techniques for bounding the number of (d, A)-distinguishable sequences A\{n,d). The gist of 
our approach is to map the symbols from F 2 x F 2 into symbols from some sufficiently large alphabet for which the distance 
constraints may be captured in a unique manner. We then proceed by introducing the complete distance enumerator for ALD 
codes, and use these enumerators to dehne an appropriate LP. 

In order to explain why we need to map the symbols of F 2 x F 2 into symbols from a larger alphabet, consider the following 
natural mapping onto Z 4 : 

(0;0)^0, (0;1)^1, (1;0)^2, (1; 1) ^ 3. 

Since 2 — 1 = 0— 3 mod 4, one cannot distinguish between the binary pairs (1; 0), (0; 1) and (0; 0), (1; 1) based on the difference 
of their images. To insure distinguishability of the differences of the images, one may consider using a mapping from F 2 x F 2 
into F 2 X F 2 X F 2 . Unfortunately, this mapping results in poor bounds, due to the fact that many of the LP constraints derived 
from this mapping are redundant. 

Consider another mapping : F 2 x F 2 —>■ Zio given by 

(0;0)^0, (0;1)^1, (1;0)^9, (1; 1) ^ 5. 

This mapping and the pairwise differences of the images are depicted in Figure]^ For any positive integer n > 1, we extend the 
dehnition of the mapping : F 2 x F 2 —>■ Z"q in a natural manner as b) = 



Fig. 5. The map 0^0) xhe labels of the edges equal the difference of their defining vertices, taken modulo ten and reflected to lie in the interval [—5,4]. 

Let X G Z"g. Dehne the prohle of x, denoted by P{x), as (mx)xGZio 7 where = |{* : Xi = a:}|. For example, the prohle 
of a: = (5,1,4,1) G Zfg equals P(x) = (0, 2, 0, 0,1,1,0, 0, 0,0). With each x € Zio, associate an indeterminate z^- Given a 
collection of words C C F 2 x F 2 , dehne the complete distance enumerator of C as 

Wc(zo,Zi,...,Zg) = 

'wlmo,mi,...,mg]z!P°z^^ ■■■zp, 
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where 

w[mo,mi,... ,m 9 ] = |^|{(a; 6), (c; d) eC : 

- (j)^'^°'>{c;d)) = {mo,mi,... ,mg)}\, 

and where the differences are applied componentwise and evaluated modulo ten. 


Claim 1 LetC C x with d\{C) ^ d have a complete distance enumerator mg i ^ 9 ]^™° ’ ’ ’ ■ Then, the 

following claims are true: 

(i) \C\ = 'Yhmo mg • I ”^ 9 ]; which states that the number of codewords may be retrieved by setting all variables in the 

complete weight enumerator to one; 

(ii) w[n, 0,..., 0] = 1, which follows from the definition of the complete weight enumerator; 

(Hi) w[mo,..., mg] = 0 whenever m 3 > 0 orm^ > 0, which ensures that the edge labels ±3 are not allowed; 

(iv) u>[mo,..., mg] = 0 if (1 + A)(mi + mi +m6 + mg) + A(m 2 d-ms) + 2(1 + A)m 5 < d, which captures the minimum distance 
constraint. 

Let 

X(x, y) = exp(-27rt {x^ ■ y)/10), 

where t denotes the imaginary unit, x,y e I/I 3 , and x'^ y = Clearly, x(a;,y) = nr=i 2/*)- For x G 

suppose that P{x) = {mo,..., mg) and dehne z{x) = J([j 


Lemma 12. Fix x G Z"g and suppose that P{x) = (mg,..., mg). Then 

X! ^iy)x{x,y) = Fz{mo,mi,...,mg), 


y&i 


where 


9/9 N 

Fz{mo,mi ,... ,mg) = HE ZtX{i,j) 

j—0 Vi—0 / 


( 8 ) 


that 


Proof: The claim follows by induction. The base case n = 1 can be easily verified, while for n ^ 2, it suffices to observe 


X! ^{yi^y^)x{{xi,x2),{yi,y2)') = 


(y-i.yg) 


Theorem 13. LetC C F 2 x F 2 and as before write 

^ w[mo, ...,mg]z{f° 

for the complete distance enumerator ofC. Define b[mo ,..., mg] via 

w[mo,...,mg]F^{mo,...,mg) = 

b[mo,... ,mg]z{f° ■ ■ ■ z{p^ 

Then b[mo,..., mg] ^ 0, for all mo,..., mg. 


Proof: For ease of notation, let x,y gC denote two codewords (a:i;a; 2 ) and {yi;y 2 ), respectively. Consider the following 
expression: 

E ^i'^)x{u,f^^°\x)-())^^°\y)). 

x,y£C 


On the one hand, using Lemma 12 we may rewrite the given expression as 

ici E w[mo,.. .,mg\F^{mo ,... ,mg). 


mQ,...,mg 
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n\dx^l 

I 2 3 

4 

5 

6 

7 

8 

9 

to 

II 

12 

13 

14 

15 

16 

17 

18 

19 

20 

I 

- - 2 

2 

















2 

_ _ _ 

- 

2 

2 

2 

2 













3 





4 

4 

2 

2 

2 

2 









4 







4 

4 

3 

3 

2 

2 

2 

2 





5 









6 

6 

3 

3 

2 

2 

2 

2 

2 

2 


TABLE III 

Results of the Delsarte LP approach for A = 1 


n\dx=l 

I 2 3 

4 

5 

6 

7 

8 

9 

10 

II 

12 

13 

14 

15 

16 

17 

18 

19 

20 

I 

- - 3 

3 

















2 

_ _ _ 

- 

2 

2 

2 

2 













3 





5 

5 

2 

2 

2 

2 









4 







5 

5 

4 

4 

2 

2 

2 

2 





5 









9 

9 

4 

4 

4 

4 

2 

2 

2 

2 


TABLE IV 

Results of 0 for A = 1 


On the other hand, by switching the order of summation, one arrives at 

x,yGC 


u^Z'^Q x.yGC 


= i'^x{u,-(l)^'^°\y)) 

uGZ^q \xGC / \y^C i 


= 51 ^(“)| 




x^C 


The proof follows since 




> 0 . 


As a consequence of the above results, an upper bound for |C|, with d\(C) given, may be obtained from the following LP; 


maximize 5 ^ ..., mg], subject to 

mo,...,mg 

u;[n,0, ...,0] = 1, 

w[mo ,..., mg] = 0, if m 3 > 0 or my > 0, 
w[mo ,..., mg] = 0, if (1 + A)(mi + 1714 

+ mg + mg) + A(m 2 + mg) 

+ 2(1 + A)m5 < d, 

mo,...,mg 


The multivariate polynomial inequality is to be interpreted as a nonnegativity constraint on the coefficients of the polynomials. 

As can be seen from Tables m and |IV| for a number of cases where d ^ 2n + 1, the bounds obtained from Delsarte’s method 
are tighter than the bounds obtained via the LP techniques of Section III For the case d < 2n + 1, the output of the LP from 
Theorem fOl was unbounded. 


V. Lower Bounds from Linear Code Constructions 

Throughout the remainder of the paper we present constructive lower bounds on the sizes of codes under the ALD. As a 
starting point, we focus mainly on the case where A = 1 although many of the constructions can be extended to other parameter 
settings. Also, for simplicity, we first address the case where the minimum ALD is equal to three. 

For a positive integer v, let Lfo.i € F 2 be a matrix which has as its columns all non-zero vectors from F 2 , except for 

the all-ones vector. Write iFo 1 = {hi, hg ,..., where for i S {1,..., 2^^ — 2}, hi represents the i-th column of TTo.i- 

Furthermore, let 1„ G be the all-ones vector. 
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In this setting, define C;(2'" — 2) C F2 ^ x F2 ^ as 

Ci(2" - 2) :={(a; b) e Ff x Ff : 

2^-2 2’'-2 

^ ^ • hi ^ ^ ^ bi • lu — 

i=l i=l 

where (a; b) = ((oi,..., a2-_2); (^i, • • ■, &2’'-2))- 

The subscript “Z” in the definition of the code Ci(2'" — 2) serves to indicate that the code is a linear code. From the introductory 
discussion, observe that a code with A = 1 and minimum ALD distance three can either: 

1) Correct a single Class 1 error, or 

2) Detect a single Class 2 error. 

In the following lemma, we show that the above defined code C;(2’' — 2) can perform either 1) or 2). 

Lemma 14. For any positive integer n, dx=i{Ci{2'" — 2)) ^ 3. 

Proof: Let n = 2'" — 2. Suppose that (a; b) S Ci(2'" — 2) was transmitted and that the pair of words (c; d) G F2 x F2 
was received, where (c; d) is the result of at most one Class 2 error occurring in (a; 6). For the remainder of this proof, let 
s = X]r=i Ci • + X]r=i denote the decoding “syndrome”. If a single Class 2 error occurred in position j G {1,..., n} 

of a, then s — hj 0. Otherwise, if a Class 2 error occurred in position j of 6, then s = 7^ 0. Clearly, if no Class 2 errors 
occurred, then s = 0. Thus, Ci{2'" — 2) can detect whether a single Class 2 error has occurred by checking if s is non-zero. 

Suppose that (a; b) G C/(2'" — 2) was transmitted and that the vector (c; d) G F2 x F2 was received, where (c; d) = 
((ci; di), (c2; df), ■ ■ ■, [cn] dn)) is the result of at most one Class 1 error in (a; b). We describe next how to recover (a; b) from 
{c;d). Let s = s -f 1„, where s denotes the previously defined syndrome. Note that if (c; d) is the result of a single Class 1 
error in position j of (a; b), then 

C n n \ 

^ ^ 0% ' hi -f ^ ^ di ' 1^ j “t“ 'i-v — hj T 1.y -f 1^ — hj . 

i=l i=l / 

Otherwise, if no error occurred, s = s + = 1„. From the above discussion, and by recalling that hj 7^ by construction, it 

is clear that a decoder for Ci{2'" — 2) can recover (a; b) from (c; d), and (c; d) from 5 as follows. If 5 = 1„, then the decoder 
concludes that no errors have occurred. Otherwise, if s = hj for some j G {1,..., n}, then the decoder corrects a Class 1 error 
at position j. ■ 

For a tt G F2, let — 2) C F2 x F2 be a linear coset of Ci{2'" — 2) so that 


C,,„(2" - 2) :={(a; b) G Ff"" x Ff-^ 


2^-2 


■2 ^ 2 
2^-2 


^ ^ ' hi T ^ ^ bi ' i-y — It}. 


(9) 


The next result follows directly from Lemma 14 It will be used in the following section. 


Corollary 15. For any positive integern and u G F2, dA=i(Ci,u(2’' — 2)) ^ 3. 


As a consequence of Lemma and Proposition we have 


4" 

—^ < AA=i(n,3) ^ 
n + 2 


2”(2”+^ _ 2) 
n + 1 


We now turn our attention to the problem of constructing codes with A = 1 and minimum ALD equal to d, where d > 3 is 
arbitrary. We first describe the code construction, and then proceed to provide a proof for its correctness. 

Let G F2^^" be a parity check matrix for a code C with Hamming distance d. Write Hd = {h[,... ,h 2 n), where h[ 
denotes the z-th column of Fid, and define CL{n) C F2 x F2 according to 


Crin) :={(a; b) = ((ai; 61 ),..., (a„; 6„)) G F^ x F^ 

n n 

• {h[ + h'„^i) = o|. 

i=l i=l 

Similar to before, the subscript “L” on Crin) refers to the fact that this is a linear code, and in order to distinguish it from 
the distance-three code Ci, we used the capital letter L. 
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We have the following result. 

Lemma 16. For positive integers n and d ^ 4, d\=i{CL{n)) ^ d. 

Proof: Let (a; b) G CL{n) for d ^ 4. Since (a; b) G Crin), ’ (^i + ^n+i) = 0- Therefore, if 

(c; d) G Ci(n), then X]r=i c-i- h^ + + ^n+i) = 0 as well. By combining these two equations, we deduce that given 

(a; b) G Crin), (c; d) G Crin) if and only if 


+ Ci) • b.' + ^(bi + di) • {h[ + b.^_|_j) — 0. 


( 10 ) 


It remains to show that for any (c; d) G F2 x F2 where dA=i((a; b); (c; d)) < d, equation (lOi does not hold. Suppose, 
in particular, that (c; d) is the result of ei Class 1 errors, €2 Class 2 errors, and €3 Class 3 errors occurring in (a; b), where 
Cl + 2e2 + 463 < d. Then, 

n n 

+ Cl) • b' + ^(bi + di) ■ {h[ + 

i=l i=l 

can be written as a sum of at most ei + 2e2 + £3 columns of Hd- Since ei + 2 e 2 + ics < d, and consequently ci + 2 e 2 + £3 < d. 


and since Hd is a parity-check matrix of a code with Hamming distance d, we conclude that (lOi does not hold. Therefore, 
Er=i Ci • K + ELi ■ (b' -f f 0 and so (c; d) Crin). ■ 

We now comment on the size of the code CL{n) for the case when dA=i = 5. More precisely, we compare the ALD codes 
Crin) with distance five to codes constructed for the Hamming distance. In particular, we focus on (a) binary codes of length 
2n; and (b) the binary image of quaternary codes of length n. 

For (a), since a code with minimum ALD equal to five must be able to correct four errors (resulting from two Class 1 errors), 
we require a binary code C that has minimum Hamming distance at least nine, so that one has 

c)2n 

|CK 


l^j=0 


2 n 

j 


which follows from the sphere packing bound. For 2n = 2" — 2, where u ^ 2 is a positive integer, 

4" 

^ (2n-f 2)2’ 

provided that one-bit shortened primitive binary BCH codes of minimum Hamming distance 5 are used as defining codes Hd- 
In this case, \C\ ^ |CL(n)|. Proposition for the case A = 1 and distance five, and the lower bound on |Ci(n)| imply that for 
2 n = 2 '" — 2, with u > 2 an integer, it also holds that 

4" 3-2” 

- =‘) ■ 

Next, consider the second case, (b). A quaternary code C with minimum ALD distance five has to have Hamming distance at 
least five. Writing out the sphere packing bound for quaternary codes with Hamming distance five, we arrive at 

4" 


1^1 ^ y^2 


i=o \j 


)3i 


For large enough n, this value is strictly smaller than (271+2)^ ’ which is a lower bound on \CL{n)\ for d\-i = 5. More precisely, 

4" 4" 




< \CL{n)\, 


whenever v ^ 5. Hence, the previously proposed constructions also outperform the “direct approach” with a quaternary alphabet. 


VI. Lower Bounds from Non-Linear Code Constructions 

In what follows, we present constructions of non-linear codes under the ALD. In many instances, the new code constructions 
result in codebooks larger than their linear counterparts. We first consider the simplest case when d = 2 and A = 1, and codes that 
can detect one single Class 1 error. Then, we extend our ideas to the case where the minimum ALD of the code is equal to three 


and A = 1. For this parameter case, we improve upon the construction in Lemma 16 whenever the block length is n = 2’' — 2, 
for any positive integer u ^ 5. General code constructions are given at the end of the section. 


Recall that the weight of a vector pairing (a; b) = ((ai, 


7);(b: 


1)• ■ •) 


b„)) G F2 X F2 is defined as w(a;b) = |{i : 


f- bz}I. Let S (ct, b) — (nji ? ■ • ■: 


V™(a„b) 


) be a subsequence of a obtained by deleting all the entries of a on which a and 
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b agree. In other words, if S{a; b) = (a^^,... ^j), then ji < j2 < • • • < jw(a,b) is the largest set of indices for which 

Qji ^ bj^, ^ bj^,..., ^ b)■ Furthermore, we denote the set of indices supported in 5 (a; 6), i.e., the set of indices 

for which a and b differ by Supp(a; b) — {ji,j2, ■ ■ ■ ,jw{a.b)}- Furthermore, for a set U = {ui, M2, • ■ •, Um} C n}, we 

let (a; b)u = .. .,au^)\ (6^1,^^2, • ■ ■,bum)) G x F^, where mi < M2 < • • • < m^. 

Dehne Cp{n) C F2 x F2 according to a parity and a weight constraint 

n 

Cp{n) :={(a; 6) G F2 x F2 : w{a; b) > 1, ^ at = 0 mod 2} 

U {(a; 6) e F” X F” : w{a-, b) = 0}. 

The subscript “p” in Cp{n) serves to indicate that the underlying code enforces a simple parity-check constraint. As before, let 
dH{x,y) stand for the Hamming distance between two binary vectors x,y G F^. Equipped with this notation, we may state the 
following result. 

Proposition 17. Let the code Cp{n) be defined as above, with n ^ 1. The code has £?>=! = 2, and 

\Cp{n)\ = 22"-i + 2"-i. 

The code Cp{n) has optimal code size for the given minimum ALD and code length. 

Proof: Consider an arbitrary code C of length n and with dA=i = 2. We partition the codewords in C into subsets of the 

form 

Ct,v ■= {{a;b) G C : Supp(a; b) = T, 

(a;6)[„]\T = 

where T ranges over all subsets of and where v G is arbitrary. We observe that a word (a; 6) G Ct,v is 

completely determined by a. In particular, bj = Uj + 1, if j G T, and bj = aj otherwise. 

If T is empty, then \Ct.v\ ^ 2" for all v. 

We show next that for any non-empty T and any m G F 2 \Ct,v\ ^ 

Let fc G r be an arbitrarily chosen element. We proceed by considering the following set; 

c't.'v ■= {(“; b)rp\k '■ (a; b) G Ct,v}- 

We claim that \C'rp .^\ = Suppose that on the contrary, \C'j. .^\ < |Ct.i,|. Then, there exist two codewords (a; b), (c;d) G Ct,v 

such that (a; h){i,...,n}\fc = (c; d){i,...,n}\fc- In other words, (a; b) and (c;d) only differ in one single position k. Furthermore, 
given that k G T, it follows that d},=i((a; b), (c; d)) = 1, which contradicts the original assumptions as (a; b), (c; d) cannot both 
belong to C. Hence, if T is non-empty, \Ct,v\ = \Ctv\ so that 




2 ” = 2 ^ 


Note that Cp{n) can detect a single Class 1 error, since a single Class 1 error changes the parity of a for any codeword in the 


first subset of Cpln). Hence, dx^i{Cp{n)) ^ 2 and given that simple counting arguments show that |Cp(n)| = 2^ 


the 


result follows. ■ 

The basic idea behind Proposition [T7| was to use two sets, {{a;b) G F2 x F2 : w{a;b) ^ 1} and {(a;b) G F2 x F^ : 
w{a; b) = 0}, the hrst of which imposes a parity constraint that increases the ALD. The second set was used to enlarge the 
resulting codebook. We next pursue a similar idea which relies on partitioning F2 xF^ into subsets that satisfy different constraints 
that boost either the ALD distance or the code size. 

Our next immediate goal is to construct a family of codes C(n) with n = 2" — 2, where m ^ 5, that have minimum ALD at 
least three, and size 

An 

\Cin)\^—+ 2 -. 
n + 2 

Recall that a code with minimum ALD equal to three, for A = 1, can either (a) correct a single Class 1 error or (b) detect a 
single Class 2 error. We consider the following two sets of paired words Vi and V 2 , where every (a;b) G Vi has the property 
that w{a; b) ^ 7 and w{a; b) = 1 mod 2; and every (a; b) G V 2 has the property that M;(a; b) > 9. 

Let Cnii) C F^ denote a binary code of length i with minimum Hamming distance three. Recall the dehnition of the set 
iS(a; b) from the introduction of this section and let 


Ci(n) :=|(a; b) G Vi s.t. 5(a; b) G CH{w{a; h))|, 
C2{ri) := |(a;h) G V 2 


and 
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for C; u as defined in (|^, and for some m € F^. 
Furthermore, define C{n) C F 2 x F 2 as 


C(n) := Ci(n) U C 2 {n). 


Lemma 18. For any positive integerv, d\^i{C{2'" — 2)) ^ 3. 

Proof: Let n = 2^ — 2. We show that C{n) has minimum ALD three by demonstrating that C{n) can either correct a single 
Class 1 error or detect a single Class 2 error. 

We start by establishing that C{n) can correct a single Class 1 error. Suppose that (a; 6) G C{n) was transmitted and that 
(c; d) G F 2 X F 2 was received, where (c; d) is the result of at most one Class 1 error occurring in (a; b) at some position 
j G Supp(a;6). Note that d//(5(a; 6), 5(c; d)) ^ 1. As a result, if w{c;d) = i ^ 7, one can use the decoder for CH{w{c;d)) 
to correct the Class 1 error. Otherwise, if * ^ 9, one can use the decoder for Ci^u{n) to correct the Class 1 error. 

Suppose that (a; b) G C{n) was transmitted and that (c; d) was received, where (c; d) is the result of at most one single 
Class 2 error occurring in (a; b). Assume first that w{a\ b) = i ^ 7. Since w{a; b) = 1 mod 2, we have w{c-, d) = 0 mod 2 if 
a Class 2 error occurred, and w{c; d) = 1 mod 2 otherwise. Thus, we can detect a Class 2 error whenever w(a; 6 ) = i ^ 7. If 


w{a; b) ^ 9, the result follows from the arguments presented in Lemma 14 
The next lemma provides a bound on the cardinality of C(n). 


Lemma 19. Forv ^ 5, |C(2’' -2)\^ 2^. 

Proof: Using an averaging argument, one can prove that 


ICiHI ^2’^ 


/cG{1.3,5,7} 


( 11 ) 


More precisely, the claim follows by observing that a Hamming code of length i and its coset codes partition the space F^; each 
of the i + 1 coset codes has minimum distance three. Therefore, there exists binary codes , of lengths 1,3, 5, 7, 

respectively, each with minimum Hamming distance three, so that 


for iG {1,3,5,7}. 

i + 1 


From i G (1,3, 5, 7}, it is straightforward to derive a code Ci(n) C Vi whose cardinality satisfies (11 1 . 
Using a similar reasoning, there exists a G F 2 in the definition of C 2 {n) for which 


K 2 M\ > 2 " • so that 

' ^ ^ n + 2 


fc=9 


icHi^2’^.( y] ^+E 

,fcG{l,3.5.7} fc=9 



ECU S 




(n + 2 )({ 


fcG{l,3,5.7} 



n +2 n +2 


B{n), 


where 




fcG{l,3,5,7} 

Since B{n) — (n + 2) > 0 for n ^ 30, the result follows. 


/c=0 


k ' 
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We consider next the case d > 3, with A = 1. The basic idea behind the construction is to map the symbols from F2 x F2 to 
{0,1, 2, 3} and then use codes in the Lee metric with proper modifications. The map of interest : F2 x F2 —{0,1, 2, 3} is 
defined as follows: 


(^W(0;0)^0, 

As before, for a word {a;b) G F2 x F2, the map is extended in the natural manner according to (j)^^'>{a]b) = (ai; bi), 

..., bn))', the image of a set under the extended map is the set of images of elements of the set under the map 

I - 

Let q be an odd prime and suppose that £ and d ^ 2 are positive integers. Let u G Zd and 2; G F^^ . Furthermore, suppose 
that a is a primitive element of Then, for n = — 1, define CN{n, u, z) as follows: 

CAr(n,M,2) := {(a;b) gF” xF” : (12) 

71 

= u mode? 

71 

'^(t)^^\ai',bi) ■a'- = zi 
2=1 


with all operations performed over ¥gi. 


= zid/2\}, 

2=1 


1 — I 

hemma 20. Let q be an odd prime where q ^ d+l.Forn = q^—1, dan odd positive integer, u G Zaandz G , d;^^i(Cjv(n, u, z)) ^ 

d. 

Proof: Suppose that (a; 6), (c; d) G Cjv(n, u, z). For simplicity, let Supp(a; b) stand for [n] \ Supp(a; b). Furthermore, let 

I = \{mG Supp(a; b) n Supp(c; d) : Om Cm}\, 

J = \{m G Supp(a; b) n Supp(c; d) : Um ^ Cm}|, and 
K — I G [ti\ . (Urn! bni ) ^ (Cm, dni)'^ \ (T J) • 

In order to show that dA=i((a; b), (c; d)) ^ d, we need to prove that I + 2K + 4 J ^ d. Clearly, if over F^ the Lee distance of 
(j)^'^\CN(n,u, z)) is at least d, then 

/ + 2Fs: + 4 J ^ J + 2Fs: + 3 J ^ d 


and the result holds. 

To see that the Lee distance of CN{n,u, z) is at least d, notice that we can recover any error vector of weight at most [|J 
from the power sums listed in (121. More precisely, if the Lee weight of (e; /) is at most [|J, where (e; /) denotes the error 
vector, it is known from 1 181 that given {^i\ fi) ' ^ for A: = {0,1 ,..., [|j}, one can uniquely determine the vector 


/). Clearly, from (12i, we have the values of (®*i /*) ’ rr* ^ for fc G {1,..., }. For k = 0, we can uniquely 

determine fij as by assumption, the Lee weight of f) is at most [|J. ■ 

Using an averaging argument that exploits the fact that the code from (12i partitions the space into d{n + sets of 

codewords, we arrive at 

4" 

\CN{n,u,z)\ ^ 

which for df: 7 improves the lower bounds from the previous section. Recall from Section]^ that one alternative to using codes 
in the ALD is to construct codes in the Hamming distance over F4. For instance when d = 3, a code with minimum Hamming 
distance 3 can correct any erTor with Hamming weight one and so it has codewords which are at distance at least 3 under the 
ALD. Since the largest code of distance d = 2r + 1 in the Hamming metric over F 4 has cardinality -^ we see that 




, 33 - 


J 


for d ^ 9, and n large enough the quantity in ( |13| l is larger than the largest possible code with Hamming distance d over F4. 
From Proposition [9j we also conclude that 

71 /r}\ 

< AA(n,2r + l) <2"-^ 


d{n + 1)’ 


^7=0 
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\c^A = l 

3 

5 

7 

1 

1-3 



2 

2-9 

1-2 

1-2 

3 

6-30 

1-7 

1-5 

4 

18-99 

3-21 

1-13 

5 

57-336 

6-67 

1-35 

6 

196-1161 

17-219 

2-101 

7 

683^080 

52-731 

5-296 

8 

2428-14535 

162-2483 

13-895 

9 

8739-52377 

525-8563 

38-2783 

10 

31776-190557 

1734-29901 

113-8890 


TABLE V 

Comparison of the upper bound based on the LP in 0 (VALUE ON THE RIGHT OF THE DASH LINE) WITH ([T^ (VALUE ON THE LEFT OF THE DASH 

LINE). The relative gap increases WITH INCREASING ALD. 


where d = 2r + 1. Note that the gap between the upper and lower bounds is a function of r as well as n. Table V compares the 
lower bounds of ( [T^ with the LP bound of Q. 

The last construction we present may be used to generate codes under the ALD for a large range of parameters. In particular, 
although our focus is on codes with integer A values, we observe that the construction in question may also be used to generate 
ALD codes with 0 < A < 1. 

Let CMin, d) be a code over F3 of length n with minimum Manhattan (£ 1 ) distance d. Similarly, let Cnin, d) denote a binary 
code of length n and minimum Hamming distance d. Define 

Cxin,d) ■.={{a;b) G {0,1}” x {0,1}” : 

(oi + &i, 02 + 62, • ■ ■ , On + bn) G CMin, {-—^—-] ), 

i + A 

Sia;b) G CHiwia;b),\j'\)}, (14) 

where addition is performed over the reals. We have the following lemma. 


Lemma 21. Given integers n and d, one has d\(C\(n, d)) ^ d. 
Proof: Suppose that (a; b), (c; d) G C\{n, d), and let 


I =\{mG Supp(a; b) n Supp(c; d) ■. Um Cm}\, 

J = \{m G Supp(a; b) n Supp(c; d) : Qm Cm}\, and 
K — I {tTI, G [rz] . (Orn ; bni ) (Cni ; dni)\ \ (T J) • 
Recall that if d\iia; b), (c; d)) ^ d, we need to show that 


A • / + (1 + A) • a: + 2(1 + A) • J > d. 


If (ai +bi,...,am + bm) (ci + di,... ,Cm + dm), then AT + 2 J > from (14i. Thus, 

d ^(1 + A) ■ (AT + 2d)=(l + A) • A” + 2(1 + A) • d, 

which proves the claimed result. Otherwise, if (oi + bi,..., Qm + bm) = (ci + di,..., + dm), the result is an immediate 
consequence of ([T4|). ■ 


VII. Conclusion 

We introduced a novel distance measure suitable for modeling the confusion graph of a DNA-based data storage readout 
channel. The distance, termed the asymmetric Lee distance (ALD) is related to the Lee distance, but refines the Lee distance 
in terms of allowing symbol weightings and asymmetries in the symbol distances. We derived upper bounds on the largest size 
of ALD codes using LP-based techniques, and provided linear and non-linear code constructions for some choices of symbol 
weighting parameters. 
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